A High-Performance Implementation of Atomistic Spin Dynamics Simulations on x86 CPUs

نویسندگان

چکیده

Atomistic spin dynamics simulations provide valuable information about the energy spectrum of magnetic materials in different phases, allowing one to identify instabilities and nature their excitations. However, time cost evaluating dynamical correlation function $S(\mathbf{q}, t)$ increases quadratically as number spins $N$, leading significant computational effort, making simulation large systems very challenging. In this work, we propose use a highly optimized general matrix multiply (GEMM) subroutine calculate spin-spin that can achieve near-optimal hardware utilization. Furthermore, fuse element-wise operations calculation into in-house GEMM kernel, which results further performance improvements 44\% - 71\% on several relatively lattice sizes when compared implementation uses OpenBLAS, is state-of-the-art open source library for Basic Linear Algebra Subroutine (BLAS).

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SSE Implementation of Multivariate PKCs on Modern x86 CPUs

Multivariate Public Key Cryptosystems (MPKCs) are often touted as future-proofing against Quantum Computers. It also has been known for efficiency compared to “traditional” alternatives. However, this advantage seems to erode with the increase of arithmetic resources in modern CPUs and improved algorithms, especially with respect to Elliptic Curve Cryptography (ECC). In this paper, we show that...

متن کامل

Atomistic spin model simulations of magnetic nanomaterials.

Atomistic modelling of magnetic materials provides unprecedented detail about the underlying physical processes that govern their macroscopic properties, and allows the simulation of complex effects such as surface anisotropy, ultrafast laser-induced spin dynamics, exchange bias, and microstructural effects. Here we present the key methods used in atomistic spin models which are then applied to...

متن کامل

Visualization in atomistic and spin simulations

Although I don't have a magic solution to the work involved in raising children (although I rather like my results and theirs), I can describe some good solutions for atomistic and spin visualization. Certain basic demands from visualiza-tion code are common to all atomistic or spin systems; some are also relevant to more general visualization needs. Many commercial and homegrown visualiza-tion...

متن کامل

MARSSx86: A Full System Simulator for x86 CPUs

We present MARSS, an open source, fast, full system simulation tool built on QEMU to support cycle-accurate simulation of superscalar homogeneous and heterogeneous multicore x86 processors. MARSS includes detailed models of coherent caches, interconnections, chipsets, memory and IO devices. MARSS simulates the execution of all software components in the system, including unmodified binaries of ...

متن کامل

High-performance Physics Simulations Using Multi-core CPUs and GPGPUs in a Volunteer Computing Context

This paper presents two conceptually simple methods for parallelizing a Parallel Tempering Monte Carlo simulation in a distributed volunteer computing context, where computers belonging to the general public are used. The first method uses conventional multi-threading. The second method uses CUDA, a graphics card computing system. Parallel Tempering is described, and challenges such as parallel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Physics Communications

سال: 2023

ISSN: ['1879-2944', '0010-4655']

DOI: https://doi.org/10.1016/j.cpc.2023.108851